CLAIMS 



1. A method of determining regression functions from a 
computer data input, the regression functions for use in 
data mining, prediction, calibration, segmentation or 
response analysis, the method using JC-Harmonic Means 

regression clustering and comprising the steps of: 

selecting K regression functions /i>— >/*; 

associating an i-th data point from a dataset with 
a k-th regression function using a soft membership function; 

providing a weighting to each data point using a 
weighting function to determine a particular data point's 
participation in a calculation of a residue error; 

calculating a residue error between a weighted 
i-th data point and its associated regression function; 

iterating to minimize a total residue error; and 

identifying suitable regression functions for use 
in the analysis. 

2. The method of claim 1, wherein the soft membership 
function provides a probabilistic association between the 
i-th data point and the k-th regression function. 

3. The method of claim 1, wherein not all data points 
of the computer data input fully participate in each 
iteration. 

4. The method of claim 1, wherein the total residue 
error is described by a function in L q - space and the 

parameter q is greater than 2. 

5. A method of determining regression functions from a 
computer data input Z = ( X * Y> ) = 1 1 = N ) , the regression 
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segmentation or response analysis, the method using K- 
Harmonic Means regression clustering and comprising the 
steps of: 

selecting K regression functions /i»""»/jr f in an 
r-th iteration; 

associating an i-th data point from the dataset Z 

with a k-th regression function ^* using a soft probability 
membership function that can be expressed as, 



is a variable parameter; 

providing a weighting to each data point Zi using 
a weighting function that can be expressed as, 



to determine the data point's participation in calculating a 

residue error; 

calculating a residue error between a weighted 

i-th data point and its associated regression function; 

iterating to minimize a total residue error; and 
identifying suitable regression functions for use 

in the analysis. 

6. The method of claim 5, wherein the residue error is 
defined by an error function that can be expressed as 




where , 




and where q 




e{f k {x^yi)^fM)-y^ p 



, with P* 2 . 



7. The method of claim 5, wherein a regression 



optimization is used that satisfies, 



N 



//'> = arg nun 2>,<*)P(Z* I z,) " /(*,) - y, "« 




K 
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8. The method of claim 5, wherein the fC-Harmonic Means 
objective function is defined by replacing the MIN{) 
function of a K Means objective function with a harmonic 
average , HA ( ) , function . 

9. The method of claim 5, wherein iterations are done 
until the change in the objective function between 
iterations is less than a predetermined threshold. 

10. The method of claim 8, wherein the objective 
function can be expressed as: 

Perf RC _ KHMp (Z, M ) = £ HA { II /, (x,) - y, II' } = 



11. The method of claim 5, wherein the determined 
regression functions are used to predict outcomes from a new 
dataset . 



12. The method of claim 11, wherein the predictions 
from the K regression functions on a new data point are 
provided as K possible values each with an associated 
probability. 

13. The method of claim 5, wherein the determined 
regression functions are additionally used to initialize an 
Expectation Maximization regression clustering method. 

14. A system for determining regression functions from 

a computer data input Z = ( X > Y ) = 1 = L-.Af} , the system 

using K-Harmonic Means regression clustering and comprising: 

data input and storage means to receive and store 
the computer data input; 

a determined-regression- function display; 

a processor providing for: 
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selecting K regression functions fi>""fx , in 
an r-th iteration; 

associating an i-th data point from a dataset 

Z with a k-th regression function f* using a soft membership 
function; 

providing a weighting to each data point Zi 
using a weighting function to determine the data point's 
participation in calculating a residue error; 

calculating a residue error between a 
weighted i-th data point and its associated regression 
function; 

iterating to minimize a total residue error ; 

and 

determining suitable regression functions for 

output . 



15. The system of claim 14, wherein the system calls a 
regression analysis step as a subroutine from an existing 
program or library. 

16. The system of claim 14, wherein the soft membership 
function provides a probabilistic association between the 
i-th data point and the k-th regression function. 

17. The system of claim 16, wherein the soft membership 
function can be expressed as, 

ww/tw h ^ii/rv,)-*" P >2 „ h 

' ' =I where, * ' ' , H , and where q 

is a variable parameter. 

18. The system of claim 14, wherein the weighting 
function can be expressed mathematically as, 
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«,(2,)=z<'/i rf '' 



19. The system of claim 14, wherein the determined 
re g ress ion functions approximate sales or marketing data, 
provide economic demand curves, identify segmentation of 
customer responses, provide calibration parameters, or 
provide segmentation or interpolation of static or video 
images . 
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